首页> 外文OA文献 >Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding
【2h】

Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding

机译:多脚本场景中的场景图像和视频帧中的单词搜索   使用动态形状编码

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Retrieval of text information from natural scene images and video frames is achallenging task due to its inherent problems like complex character shapes,low resolution, background noise, etc. Available OCR systems often fail toretrieve such information in scene/video frames. Keyword spotting, analternative way to retrieve information, performs efficient text searching insuch scenarios. However, current word spotting techniques in scene/video imagesare script-specific and they are mainly developed for Latin script. This paperpresents a novel word spotting framework using dynamic shape coding for textretrieval in natural scene image and video frames. The framework is designed tosearch query keyword from multiple scripts with the help of on-the-flyscript-wise keyword generation for the corresponding script. We have used atwo-stage word spotting approach using Hidden Markov Model (HMM) to detect thetranslated keyword in a given text line by identifying the script of the line.A novel unsupervised dynamic shape coding based scheme has been used to groupsimilar shape characters to avoid confusion and to improve text alignment.Next, the hypotheses locations are verified to improve retrieval performance.To evaluate the proposed system for searching keyword from natural scene imageand video frames, we have considered two popular Indic scripts such as Bangla(Bengali) and Devanagari along with English. Inspired by the zone-wiserecognition approach in Indic scripts[1], zone-wise text information has beenused to improve the traditional word spotting performance in Indic scripts. Forour experiment, a dataset consisting of images of different scenes and videoframes of English, Bangla and Devanagari scripts were considered. The resultsobtained showed the effectiveness of our proposed word spotting approach.
机译:由于其固有的问题,例如复杂的字符形状,低分辨率,背景噪声等,从自然场景图像和视频帧中检索文本信息是一项艰巨的任务。可用的OCR系统通常无法在场景/视频帧中检索此类信息。在这种情况下,关键字搜寻是一种替代性的信息检索方法,可以有效地进行文本搜索。但是,当前场景/视频图像中的单词发现技术是特定于脚本的,并且它们主要是针对拉丁脚本开发的。本文提出了一种利用动态形状编码在自然场景图像和视频帧中进行文本检索的新颖单词发现框架。该框架旨在借助相应脚本的实时脚本生成关键字,从多个脚本中搜索查询关键字。我们采用了基于隐马尔可夫模型(HMM)的两阶段词点识别方法,通过识别行的脚本来检测给定文本行中的翻译关键词。一种新颖的基于无监督动态形状编码的方案已将相似的形状字符分组以避免接下来,验证假设位置以提高检索性能。为了评估从自然场景图像和视频帧中搜索关键字的建议系统,我们考虑了两种流行的印度文字,例如Bangla(Bengali)和Devanagari用英语。受印度脚本中区域明智识别方法的启发[1],区域文本信息已被用来改善印度脚本中传统的单词发现性能。对于我们的实验,考虑了一个由不同场景的图像和英语,孟加拉语和梵文的脚本构成的数据集。获得的结果表明了我们提出的单词发现方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号